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We report observation of the electroweak production of single top quarks in pp collisions at 
y/s = 1.96 TeV based on 2.3 fb _1 of data collected by the DO detector at the Fermilab Tevatron 
Collider. Using events containing an isolated electron or muon and missing transverse energy, 
together with jets originating from the fragmentation of b quarks, we measure a cross section of 
o~(pp — > tb + X , tqb + X) = 3.94 ± 0.88 pb. The probability to measure a cross section at this value 
or higher in the absence of signal is 2.5 x 10 -7 , corresponding to a 5.0 standard deviation significance 
for the observation. 

PACS numbers: 14.65.Ha; 12.15.Ji; 13.85.Qk; 12.15.Hh 



At hadron colliders, top quarks can be produced 
in pairs via the strong interaction or singly via the 
electroweak interaction Top quarks were first 

observed via pair production at the Fermilab Tevatron 
Collider in 1995 [2|. Since then, pair production has 



been used to make precise measurements of several top 
quark properties, including the top quark mass Q . Single 
top quark production, on the other hand, serves as a 
probe of the Wtb interaction Q , and its production cross 
section provides a direct measurement of the magnitude 
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of the quark mixing matrix element Vtb without assuming 
three quark generations However, measuring the 

yield of single top quarks is difficult because of the small 
production rate and large backgrounds. 

In 2007, we presented the first evidence for single top 
quarkproduction and the first direct measurement of 
\Vtb\ [fii III usm g 0.9 fb _1 of Tevatron data at a center- 
of-mass energy of 1.96 TeV. Recently, the CDF collab- 
oration has also presented such evidence in 2.2 fb _1 of 
data Q . This Letter describes the observation of a single 
top quark signal in 2.3 fb _1 of Tevatron data. The CDF 
collaboration is also reporting observation of single top 
quark production Q . 

When top quarks are produced singly, they are accom- 
panied by a bottom quark in the s-channel production 
mode [10( or by both a bottom quark and a light quark 
in i-channel production 0, [ll[, as illustrated in Fig. [TJ 
We search for both of these processes at once. The sum 
of their predicted cross sections is 3.46 ± 0.18 pb 
for a top quark mass m t = 170 GeV, at which this 
analysis is performed. We refer to the s-channel process 
as "tb" production, where tb includes both tb and ib 
states. The i-channel process is abbreviated as u tqb," 
where this includes tqb, tqb, tqb, and tqb states. 





FIG. 1: Representative Feynman diagrams for (a) s-channel 
single top quark production and (b) t-channel production, 
showing the top quark decays of interest. 

The analysis presented in this Letter is an improved 
version of the one from 2007 0, 0], with a larger 
dataset. Most definitions and abbreviations used here 
are explained in detail in Ref. Q. The measurement 
focuses on the final state containing one high trans- 
verse momentum (jPt) lepton (£ = electron or muon) not 
near a jet ("isolated"), large missing transverse energy 
($t) indicative of the passage of a neutrino v, a b- 
quark jet from the decay of the top quark (t~^Wb^£vb), 
and possibly another b jet and a light jet as indicated 
above. The data were collected with the DO detector [l3j 
using a logical OR of many trigger conditions in place of 
only the singlc-lcpton plus jets triggers used previously. 
Several offline selection criteria, including fe-jet identifi- 
cation requirements for double-tagged events, have been 
loosened. These improvements have increased the signal 
acceptance by 18%. The backgrounds are W bosons 
produced in association with jets, top quark pair (ti) 
production with decay into the lepton+jets and dilcpton 
final states (when a jet or a lepton is not reconstructed), 



and multijet production, where a jet is misreconstructed 
as an electron or a heavy-flavor quark decays to a 
muon that passes isolation criteria. Z+jets and diboson 
processes form minor additional background components. 

We consider events with two, three, or four jets (which 
allows for additional jets from initial-state and final-state 
radiation), reconstructed using a cone algorithm in (y, <p) 
space, where y is the rapidity and <t> is the azimuthal 
angle, and the cone radius is 0.5 [7j. The highest-pr 
(leading) jet must have pr > 25 GeV, and subsequent jets 
havepT > 15 GeV; all jets have pseudorapidity \r)\ < 3.4. 
We require 20 < $t < 200 GeV for events with two jets 
and 25 < $t < 200 GeV for events with three or four 
jets. Events must contain only one isolated electron with 
p T > 15 GeV and \rj\ < 1.1 (p T > 20 GeV for three- or 
four-jet events), or one isolated muon withp^ > 15 GeV 
and \r]\ < 2.0. The background from multijets events is 
kept to «5% by requiring high total transverse energy 
and by demanding that the $t is not along the direction 
of the lepton or the leading jet. To enhance the signal 
fraction, one or two of the jets are required to originate 
from long-lived b hadrons. We achieve this goal by using 
a neural network (NN) 6-jet tagging algorithm 14(. The 
variables used to identify such jets rely on the character- 
istics of a secondary vertex and tracks with large impact 
parameters. After fr-jct identification, we require the 
leading 6-tagged jet to have pr > 20 GeV. To further 
improve the sensitivity, we split the data by lepton flavor, 
number of jets and 6-tagged jets, and data collection 
period. 

We model the signal using the COMPHEP-based 
next-to-leading order (NLO) Monte Carlo (MC) event 
generator SINGLETOP [l5| . The decays of the top quark 
and resulting W boson, both with standard model (SM) 
widths, are modeled in SINGLETOP to preserve spin infor- 
mation. PYTHIA [l(| is used to model the hadronization 
of generated partons. We assume the SM prediction for 
the ratio of the tb and tqb cross sections [12j. 

The ti, TF+jets, and Z+jets backgrounds are 
simulated using the ALPGEN leading-log MC event 
generator [13] and pythia to model hadronization. The 
ti background is normalized to the predicted cross 
section [18f . The diboson backgrounds are modeled using 
pythia. In the simulation of the VF+jets backgrounds, 
we scale the ALPGEN cross sections for events with heavy 
flavor jets by factors derived from calculations of NLO 
effects (T^j: Wbb and Wcc are scaled by 1.47, and Wcj 
by 1.38. 

All MC events are passed through a GEANT-based 
simulation of the DO detector and are reconstructed using 
the same software as for the data. Data events from 
random beam crossings are overlaid on the simulation to 
better model the effects of detector noise and multiple 
pp interactions. Small differences between data and 
simulation in the lepton and jet reconstruction efficiencies 
and resolutions are corrected in the simulation as 
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measured from separate data samples. We also correct 
the 7/(jets), A0(jetl, jet2), and A?7(jetl, jet2) distribu- 
tions in the JT+jets samples to match data. 

The multijets background is modeled using 
independent data samples containing leptons that 
are not isolated. The multijets background, combined 
with the background from VT+jets, is normalized to the 
lcpton+jets data with other backgrounds subtracted, 
using the prW, ^t, and the W boson transverse mass 
distributions before 6-jet identification is applied. 

The 6-tagging algorithm is modeled in simulated events 
by applying weights ("tag-rate functions") measured 
from data that account for the probability for each jet 
to be tagged as a function of jet flavor, pt, and 77. After 
b tagging, an empirical correction of 0.95 ± 0.13 for the 
Wbb and Wcc fractions is derived from the 6-tagged and 
not-&-tagged two-jet data and simulated samples. 

The above selections give 4,519 6-tagged lepton+jets 
events, which are expected to contain 223 ± 30 single top 
quark events. Table H] shows the event yields, separated 
by jet multiplicity. The acceptances are (3.7 ±0.5)% for 
tb and (2.5 ± 0.3)% for tqb, expressed as percentages of 
the inclusive single top quark production cross section in 
each channel. 

TABLE I: Number of expected and observed events in 2.3 fb _1 
for e and fj,, and 1 and 2 fe-tagged analysis channels combined. 
The uncertainties include both statistical and systematic 
components. 



Source 


2 jets 


3 jets 


4 jets 


tb+tqb signal 


139 ± 18 


63 ±10 


21 ±5 


W+jets 


1,829 ± 161 


637 ±61 


180 ±18 


Z+jets and dibosons 


229 ± 38 


85 ±17 


26 ±7 


ti 


222 ± 35 


436 ± 66 


484 ±71 


Multijets 


196 ± 50 


73 ±17 


30 ±6 


Total prediction 


2,615 ± 192 


1,294 ±107 


742 ± 80 


Data 


2,579 


1,216 


724 



Systematic uncertainties arise from each correction 
factor or function applied to the background and signal 
models. Most affect only the normalization, but three 
corrections modify in addition the shapes of the distri- 
butions; these are the jet energy scale corrections, the 
tag-rate functions, and the reweighting of the distribu- 
tions in W+jets events. The largest uncertainties come 
from the jet energy scale (the normalization part is (1.1- 
13.1)% for signal and (0.1-2.1)% for background), the 
tag-rate functions (the normalization part is (2.1-7.0)% 
for single-tagged events and (9.0-11.4)% for double- 
tagged events), and the correction for jet-flavor compo- 
sition in IT+jets events (13.7%), with smaller contribu- 
tions from the integrated luminosity (6.1%), jet energy 
resolution (4.0%), initial-state and final-state radiation 
(0.6-12.6%), 6- jet fragmentation (2.0%), tt cross section 
(12.7%), and lepton efficiency corrections (2.5%). All 



other contributions have a smaller effect. The values 
given are the relative uncertainties on the individual 
sources. The total uncertainty on the background is (8- 
16)% depending on the analysis channel. 

After event selection, we expect single top quark 
events to constitute (3-9)% of the data sample. Since 
the uncertainty on the background is larger than the 
expected signal, we improve discrimination by using 
multivariate analysis techniques. We have developed 
three independent analyses based on boosted decision 
trees (BDT) [2(J, Bayesian neural networks (BNN) [2l[, 
and the matrix element (ME) method [13]. Our appli- 
cation of these techniques to DO's single top quark 
searches is described in Refs. [f| and 0]. The analyses 
presented in this Letter differ from previous implementa- 
tions in the choice of input variables and some detailed 
tuning of each technique. 

The BDT analysis has re-optimized the input 
variables [23| into a common set of 64 variables for all 
analysis channels. The variables fall into five categories, 
single-object kinematics, global event kinematics, jet 
reconstruction, top quark reconstruction, and angular 
correlations. Separate sets of trees are created with these 
variables for each channel. The BNN analysis uses the 
RuleFitJF algorithm 
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to select the most sensitive 
of these variables, then combines 18-28 of them into a 
single separate discriminant for each channel. The ME 
analysis uses only two-jet and three-jet events, divided 
into a W+jets-dominated set and a tt-dominated set. It 
includes matrix elements for more background sources, 
adding ti, WW, WZ, and ggg diagrams in the two-jet bin 
and Wugg in the three-jet bin, to improve background 
rejection. 

Each analysis uses the same data and background 
model and has the same sources of systematic uncer- 
tainty. We test the analyses using ensembles of pseudo- 
datasets created from background and signal at different 
cross sections to confirm linear behavior and thus an 
unbiased cross section measurement. The analyses are 
also checked extensively before 6-tagging is applied, and 
using two control regions of the data, one dominated by 
W+jets and the other by ti backgrounds, as shown in 
Fig. O These studies confirm that backgrounds are well 
modeled across the full range of the discriminant output. 

The cross section is determined using the same 
Bayesian approach as in our previous studies 0, @] • This 
involves forming a binned likelihood as a product over all 
bins and channels, evaluated separately for each multi- 
variate discriminant, with no cuts applied to the outputs. 
The central value of the cross section is defined by the 
position of the peak in the posterior density, and the 
68% interval about the peak is taken as the uncertainty 
on the measurement. Systematic uncertainties, including 
all correlations, are reflected in this posterior interval. 

We extract inclusive single top quark cross sections 
a(pp -> tb + X,tqb + X) of (Xbdt = 3.74+°;^ pb, er B NN = 
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(a) Final Discriminant 
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FIG. 2: The combination discriminant outputs for (a) VF+jets 
and (b) ti cross-check samples. Ht is the scalar sum of the 
transverse momenta of the final state objects (lepton, $t, and 
jets). 
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of the analyses to a contribution from single top quark 
production is estimated by generating an ensemble of 
pseudodatasets that sample the background model and 
its uncertainties, with no signal present. We measure a 
cross section from each pseudodataset, and hence obtain 
the probability that the SM cross section is reached. 
This provides expected sensitivities (stated in terms of 
Gaussian standard deviations, SD) of 4.3, 4.1, and 4.1 SD 
for the BDT, BNN, and ME analyses respectively. The 
measured significances, obtained by counting the number 
of pseudodatasets with cross sections at least as large 
as the measured cross section, are 4.6, 5.2, and 4.9 SD 
respectively. 

The three multivariate techniques use the same data 
sample but arc not completely correlated: the corre- 
lation of the measured cross section using pseudodatasets 
with background and SM signal is BDT:BNN = 74%, 
BDT:ME = 60%, BNN:ME = 57%. Their combination 
therefore leads to increased sensitivity and a more precise 
measurement of the cross section. We use the three 
discriminant outputs as inputs to a second set of Bayesian 
neural networks, and obtain the combined cross section 
and its signal significance from the new discriminant 
output. The resulting expected significance is 4.5 SD. 
Figure [3] illustrates the importance of the signal when 
comparing data to prediction. 

The measured cross section is 

<j(pp -» tb + X, tqb + X) = 3.94 ± 0.88 pb. 

The measurement has a p- value of 2.5 x 10 -7 , corre- 
sponding to a significance of 5.0 SD. The expected and 
measured posterior densities and the background-only 
pseudodataset measurements are shown in Fig. 2J 

We use the cross section measurement to determine 
the Bayesian posterior for \Vtb\ 2 in the interval [0,1] and 
extract a limit of \V t b\ > 0.78 at 95% C.L. within the 
SM [3]. When the upper constraint is removed, we 
measure \Vtbft\ = 1-07 ± 0.12, where /f is the strength 
of the left-handed Wtb coupling. 

In summary, we have measured the single top quark 
production cross section using 2.3 fb -1 of data at the DO 
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FIG. 3: Distribution of the combination output for all 24 
analysis channels combined, (a) full range, and (b) high 
signal region. The bins have been ordered by their expected 
signahbackground ratio and the signal is normalized to the 
measured cross section. The hatched band indicates the total 
uncertainty on the background. These distributions are not 
used in the cross section measurement and are for illustration 
only. For the ranked combination output > 0.92, (c) shows 
the distribution of lepton charge times pseudorapidity of the 
leading not-6-tagged jet, and (d) shows the reconstructed top 
quark mass. 
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FIG. 4: (a) Expected SM and measured Bayesian posterior 
probability densities for the tb+tqb cross section. The shaded 
regions indicate one standard deviation above and below 
the peak positions, (b) Measured cross sections using the 
ensemble of background-only pseudodatasets (containing full 
systematics and no signal) used to measured the significance 
of the result. 



experiment. We measure a cross section for the combined 
tb+tqb channels of 3.94 ± 0.88 pb. Our result provides 
an improved direct measurement of the amplitude of 
the CKM matrix element Vtb- The measured single top 
quark signal corresponds to an excess over the predicted 
background with a significance of 5.0 SD — observation 
of single top quark production. 
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